Search CORE

88 research outputs found

High Order Volumetric Directional Pattern for Video-Based Face Recognition

Author: Asari Vijayan
Essa Almabrok
Publication venue: EngagedScholarship@CSU
Publication date: 01/01/2019
Field of study

Describing the dynamic textures has attracted growing attention in the field of computer vision and pattern recognition. In this paper, a novel approach for recognizing dynamic textures, namely, high order volumetric directional pattern (HOVDP), is proposed. It is an extension of the volumetric directional pattern (VDP) which extracts and fuses the temporal information (dynamic features) from three consecutive frames. HOVDP combines the movement and appearance features together considering the nth order volumetric directional variation patterns of all neighboring pixels from three consecutive frames. In experiments with two challenging video face databases, YouTube Celebrities and Honda/UCSD, HOVDP clearly outperformed a set of state-of-the-art approaches

Cleveland-Marshall College of Law

Video-to-Video Pose and Expression Invariant Face Recognition using Volumetric Directional Pattern

Author: Asari Vijayan K.
Essa Almabrok
Publication venue: eCommons
Publication date: 01/03/2015
Field of study

Face recognition in video has attracted attention as a cryptic method of human identification in surveillance systems. In this paper, we propose an end-to-end video face recognition system, addressing a difficult problem of identifying human faces in video due to the presence of large variations in facial pose and expression, and poor video resolution. The proposed descriptor, named Volumetric Directional Pattern (VDP), is an oriented and multi-scale volumetric descriptor that is able to extract and fuse the information of multi frames, temporal (dynamic) information, and multiple poses and expressions of faces in input video to produce feature vectors, which are used to match with all the videos in the database. To make the approach computationally simple and easy to extend, key-frame extraction method is employed. Therefore, only the frames which contain important information of the video can be used for further processing instead of analyzing all the frames in the video. The performance evaluation of the proposed VDP algorithm is conducted on a publicly available database (YouTube celebrities’ dataset) and observed promising recognition rates

University of Dayton

Histogram of Oriented Phase (HOP): A New Descriptor Based on Phase Congruency

Author: Asari Vijayan K.
Ragb Hussin
Publication venue: eCommons
Publication date: 01/05/2016
Field of study

In this paper we present a low level image descriptor called Histogram of Oriented Phase based on phase congruency concept and the Principal Component Analysis (PCA). Since the phase of the signal conveys more information regarding signal structure than the magnitude, the proposed descriptor can precisely identify and localize image features over the gradient based techniques, especially in the regions affected by illumination changes. The proposed features can be formed by extracting the phase congruency information for each pixel in the image with respect to its neighborhood. Histograms of the phase congruency values of the local regions in the image are computed with respect to its orientation. These histograms are concatenated to construct the Histogram of Oriented Phase (HOP) features. The dimensionality of HOP features is reduced using PCA algorithm to form HOP-PCA descriptor. The dimensionless quantity of the phase congruency leads the HOP-PCA descriptor to be more robust to the image scale variations as well as contrast and illumination changes. Several experiments were performed using INRIA and DaimlerChrysler datasets to evaluate the performance of the HOP-PCA descriptor. The experimental results show that the proposed descriptor has better detection performance and less error rates than a set of the state of the art feature extraction methodologies

University of Dayton

Dense Point-Cloud Representation of a Scene using Monocular Vision

Author: Asari Vijayan K.
Diskin Yakov
Publication venue: eCommons
Publication date: 01/03/2015
Field of study

We present a three-dimensional (3-D) reconstruction system designed to support various autonomous navigation applications. The system presented focuses on the 3-D reconstruction of a scene using only a single moving camera. Utilizing video frames captured at different points in time allows us to determine the depths of a scene. In this way, the system can be used to construct a point-cloud model of its unknown surroundings. We present the step-by-step methodology and analysis used in developing the 3-D reconstruction technique. We present a reconstruction framework that generates a primitive point cloud, which is computed based on feature matching and depth triangulation analysis. To populate the reconstruction, we utilized optical flow features to create an extremely dense representation model. With the third algorithmic modification, we introduce the addition of the preprocessing step of nonlinear single-image super resolution. With this addition, the depth accuracy of the point cloud, which relies on precise disparity measurement, has significantly increased. Our final contribution is an additional postprocessing step designed to filter noise points and mismatched features unveiling the complete dense point-cloud representation (DPR) technique. We measure the success of DPR by evaluating the visual appeal, density, accuracy, and computational expense and compare with two state-of-the-art techniques

University of Dayton

Histogram of Oriented Phase and Gradient (HOPG) Descriptor for Improved Pedestrian Detection

Author: Asari Vijayan K.
Ragb Hussin
Publication venue: eCommons
Publication date: 01/02/2016
Field of study

This paper presents a new pedestrian detection descriptor named Histogram of Oriented Phase and Gradient (HOPG) based on a combination of the Histogram of Oriented Phase (HOP) features and the Histogram of Oriented Gradient features (HOG). The proposed descriptor extracts the image information using both the gradient and phase congruency concepts. Although the HOG based method has been widely used in the human detection systems, it lacks to deal effectively with the images impacted by the illumination variations and cluttered background. By fusing HOP and HOG features, more structural information can be identified and localized in order to obtain more robust and less sensitive descriptors to lighting variations. The phase congruency information and the gradient of each pixel in the image are extracted with respect to its neighborhood. Histograms of the phase congruency and the gradients of the local segments in the image are computed with respect to its orientations. These histograms are concatenated to construct the HOPG descriptor. The performance evaluation of the proposed descriptor was performed using INRIA and DaimlerChrysler datasets. A linear support vector machine (SVM) classifier is used to train the pedestrians. The experimental results show that the human detection system based on the proposed features has less error rates and better detection performance over a set of state of the art feature extraction methodologies

University of Dayton

Person Identification from Streaming Surveillance Video using Mid-Level Features from Joint Action-Pose Distribution

Author: Asari Vijayan K.
Nair Binu M.
Publication venue: eCommons
Publication date: 01/02/2015
Field of study

We propose a real time person identification algorithm for surveillance based scenarios from low-resolution streaming video, based on mid-level features extracted from the joint distribution of various types of human actions and human poses. The proposed algorithm uses the combination of an auto-encoder based action association framework which produces per-frame probability estimates of the action being performed, and a pose recognition framework which gives per-frame body part locations. The main focus in this manuscript is to effectively combine these per-frame action probability estimates and pose trajectories from a short temporal window to obtain mid-level features. We demonstrate that these mid-level features captures the variation in the action performed with respect to an individual and can be used to distinguish one person from the next. Preliminary analysis on the KTH action dataset where each sequence is annotated with a specific person and a specific action is provided and shows some interesting results which verify this concept

University of Dayton